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We use information theory to derive fundamental limits on the capacity to calibrate next- 
generation radio interferometers, and measure parameters of point sources for instrument calibra- 
tion, point source subtraction, and data deconvolution. We demonstrate the implications of these 
fundamental limits, with particular reference to estimation of the 21cm Epoch of Reionization 
power spectrum with next-generation low-frequency instruments (e.g., the Murchison Widefield 
Array - MWA, Precision Array for Probing the Epoch of Reionization - PAPER), where short 
time scale instrumental calibration is required due to the impact of the ionosphere on the signal 
wavefront. Finally, we explore the optimal point source precision available by using a combina- 
tion of current and prior information. Estimation schemes that incorporate prior information may 
be advantageous when the measurement precision is comparable to the characteristic refraction 
scale of the ionosphere. 
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1. Introduction 

The suite of radio interferometers currently under construction and in planning phases, has the 
potential to deliver answers to exciting scientific questions, such as describing the properties and 
evolution of early universe structures, and as a probe of variable radio sources. With this new era 
of radio interferometry comes the development of new instruments and technologies, as well as 
the challenges inherent in advancing a field and gaining new knowledge. For most programs, the 
sensitivity and resolution limits of our instruments will be pushed, and there will be an increasing 
reliance on temporal observing modes and the low-frequency domain. We will see a shift from 
narrow-field, long integration, few receiving-element science, to wide-field, snapshot observing 
with large element arrays, culminating in the ultimate multi-science instrument: the Square Kilo- 
metre AiTay (SKA). In addition to the engineering challenges inherent in the design and execution 
of the SKA, there are accompanying questions to be answered regarding the data limitations for 
such an instrument, its calibratability and realisable dynamic range. These questions are being 
increasingly addressed in the literature, and suggest a complicated path forward for the optimal 



design of the instrument and our data analysis methodology [|13|]. 

Key science goals of the SKA, and pathfinder instruments, demand precision quantitative radio 
astronomy, where an understanding of the underlying properties of the data is crucial for robust and 
unbiased science results. Such goals include the detection of new classes of fast transient sources, 
and detection and estimation of the neutral hydrogen signal from the Epoch of Reionization (EoR). 
These goals are shai^ed by many instruments currently in the development, building and commis- 
sioning phases, such as the Murchison Widefield AiTay (MWA), Precision AiTay for Probing the 
Epoch of Reionization (PAPER), Low Frequency AiTay (LOFAR), and the Long Wavelength An^ay 
(LWA) Ig |ri[ |, 0, These instruments and projects operate at low frequencies (100-200 MHz), 
where the ionopshere plays an important role in the shape of the signal wavefront. 

The impact of the ionosphere on the measured correlations (visibilities) ranges from zero (con- 
stant phase shift between antennas) to de-focussing, depending on the extent of the array, the fre- 
quency, and the characteristic size scale of ionospheric disturbances. Cohen & Rottgering 
studied the differential refraction (position change relative to other sources in the field) for Very 
Large Array (VLA) fields at 74 MHz, yielding a source shift of ~100 arcseconds at a separation of 
25 degrees under normal ionospheric conditions. 

Wide-field low frequency instruments, such as the MWA, PAPER and LWA, aim to use known 
bright point sources as calibrators to model the instantaneous effect of the ionosphere on the wave- 
front. In this work we consider an observational scheme where the calibration is performed in 
real-time, necessitating a measurement of the instrument and sky response on eight to ten second 
timescales. These solutions are applied in real-time to the measured data (visibilities). 

The precision with which the sky position of calibration sources may be estimated with limited 
data impacts the degree to which an instrument is calibratable, and consequently on the quality of 
scientifically-relevant metrics. We begin by deriving the theoretical precision with which point 
source parameters can be estimated from a given visibility dataset, and propagate these eiTors to 
additional noise terms in the visibilities. As an example scientific application of this residual signal, 
we then propagate the errors to a metric of interest for the statistical estimation of the 21 cm EoR 
signal: the angular power spectrum. We then discuss estimation whereby prior information is 
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balanced with information from the current dataset to estimate parameters, and demonstrate the 
point source position precision that may be achieved by an optimal estimator. 



2. Point source estimation limits 

2.1 The Cramer-Rao bound (CRB) 

We use the Cramer-Rao lower bound (CRB) on the precision of parameter estimates. The 
CRB calculates the precision with which a minimum-variance unbiased estimator could estimate 
a parameter value, using the information content of the dataset. It is computed as the square-root 
of the corresponding diagonal element of the inverse of the Fisher information matrix (FIM). The 
(?7)th entry of the FIM for a vector d of unknown parameters is given by [||] : 
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(2.1) 



where L denotes the likelihood function describing the likelihood of measuring a dataset, x for a 
given parameter set, 6. The CRB places a fundamental lower limit on the measurement precision 
of any parameter. In this work it will be used to gain an understanding of the fundamental limits of 
point source subtraction, and how these impact EoR estimation. 

2.1.1 Incorporating prior information - the hybrid CRB 

If one uses some prior information about the value of a parameter, in addition to the informa- 
tion contained within the current dataset, one can use a hybrid CRB (HCRLB) to obtain a measure 
of the maximum estimation precision. In this case, the maximal precision corresponds to the mini- 
mum mean squared error, and may be biased by the use of inaccurate prior information. We extend 
the formalism to incorporate the parameter of interest in the hkelihood function, treating the param- 
eter as random rather than deterministic. Using Bayes' rule, and considering the joint likelihood 
for the data and the parameter with prior information, 6: 

L{x,e)=L{x\e)L{d), (2.2) 

where L{x\d) is the likelihood of the data conditioned on the parameter value. The FIM is; 
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(2.3) 



The final term in equation |23| contains the prior information available about the value of parameter 
6. For a Gaussian-distributed likelihood function, the Fisher information contains an additional 
term corresponding to the prior information, given by, Igg = 1/a^, where is the variance of the 
Gaussian distribution. A smaller variance corresponds to precise prior information, and contributes 
more information to the Fisher matrix than a broad distribution. In reference to this work, we are 
considering the effect of the ionosphere to produce a Gaussian-distributed shift in the position of a 
source, with a characteristic refraction scale dependent on the level of ionospheric activity on the 
trmescale of interest (a). 
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Parameter 


Value (MWA) 


Value (PAPER) 


Nam 


128 


64 


Vo 


150 MHz 


150 MHz 


Bandwidth 


30.72 MHz 


70.0 MHz 


Av 


125 kHz 


125 kHz 


Field-of-view 


30'^ 


60" 


Calibrators (>1 Jy) 


392 


1579 


T 

^ sys 


440K 


440K 


A? 


8s 


8s 


^tot 


300h 


720h 




3000m 


260m 



Table 1: Parameters used for the primary instrument design of the MWA and PAPER 64-dipole instruments. 



2.2 Residual signal in visibilities 



The signal in each visibility is the linear combination of signals from each calibrator, and is 
given by: 

^[fM = J^Vi{ufn,Vfn) = J^Bi{li,mi)exp [-2m{uf„li + Vf„mi)], (2.4) 

;=1 /=1 

where Nc is the total number of calibrators, described by source strength, S,, located at sky position, 
{li,mi), for a baseline, n, and frequency channel, /. The signal is embedded within white Gaus- 
sian thermal noise, with diagonal covariance matrix, C = O^I, where a is set by the radiometer 
noise. Using the full dataset, and no prior information, the minimum uncertainty in the parameter 
estimates for calibrator, /, and their non-zero covariances, are given by 1 12]: 
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(2.6) 
(2.7) 

(2.8) 
(2.9) 



iuv =11 UfnVfn, 1,2 = I I 
n=l/=l n=l/=l n=l/=l 

We perform this analysis for two upcoming statistical EoR experiments: the MWA and PAPER. 
Each aiTay is modelled with two antenna configurations, conxsponding to minimally-redundant 
and maximally-redundant uv sampling. Figure [I] displays the antenna configurations for the four 
an^ays considered, and Table |l| displays the experiment parameters. Figure ^a) plots the optimal 
point source position precision (AZ) as a function of calibrator signal strength (Jy) for these four 
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(b) MWA. 
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(c) Actual 64-dipole minimum- (d) Potential 64-dipole maximum- 
redundancy (PAPER). redundancy (PAPER). 



Figure 1: Antenna configurations for the MWA and PAPER considered in this work. 



instruments. The linear dependence of precision with signal strength is observed. The inner core + 
outer ring structure of the MWA degrades its performance slightly compared with the uniform array. 
The short baselines and fewer antennas of PAPER degrade their ability to localize sources well 
compared with the MWA, although wider instantaneous bandwidth balances this effect somewhat. 



2.3 Propagation of errors into interferometric visibilities 

The uncertainties in signal parameters for each calibrator are propagated into uncertainty in 
the measured visibilities, in addition to the statistical thermal noise. The treatment provides a full 
covariant analysis, including any correlations between visibilities due to residual noise power. For 
a given visibility and a single calibrator, the variance is given by: 
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(2.10) 

(2.11) 
(2.12) 



where /i,/2,/3 are functions of the baseline lengths (array geometry). The residual error in each 
visibility due to each calibrator is independent of the calibrator signal strength. Therefore, the 
impact on the visibilities of subtracting each calibrator has the same magnitude for all calibrators 
[but different distributions across visibilities; the covariances also depend on the source positions, 
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Position precision 
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(a) Dataset alone. 
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(b) Characteristic ionospheric refraction scale: 60" 



(c) Characteristic ionospheric refraction scale: 10". 

Figure 2: Optimal source sky position precision (minimum mean squared error) for the dataset alone (a), 
and different characteristic ionospheric refraction scales (b, c). 

(Z,,m,)]. This result deviates from the assumptions of previous work, which assumed that the source 
position error was independent of signal strength, and therefore that residual eiTor was greatest for 
the strongest cahbrators 



3. Impact on EoR statistical estimation 

One of the primary tools of a statistical measurement of the EoR is the angular power spectrum, 
which quantifies the signal power on a given angular scale. The angular power spectrum is defined 

£ Nuv\Vuv\^ 



Ci 



(uv)el 



(uv)el 



(3.1) 



where A'^„v is the number of visibilities contributing to a given {uv) cell, and the sum is over the 
(Mv)-cells contributing to that /-mode (/ = 27r|M|). 

We propagate the uncertainties in the visibilities due to the thermal noise and residual point 
source signal to the angular power spectrum. Figure |3| displays the ratio of uncertainty in residual 
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Figure 3: Ratio of uncertainties in power for residual signal to thermal noise. 



signal to thermal noise power {Oq/Oq for the four configurations. In all cases the uncertainty 
in power exceeds the power from the point source subtraction. One can see the higher angular- 
modes suffer more severely from point source subtraction, consistent with the additional noise 



in the long baseline visibilities (equation 2.12). These figures also highlight the differences be- 
tween the array configurations; the longer baselines of the MWA sample higher /-modes, and the 
larger number of receiving elements yield improved sensitivity. Conversely, the larger instanta- 
neous bandwidth of the PAPER aiTay improves point source precision, offsetting the sensitivity 
degradation. The larger field-of-view of PAPER corresponds to a larger number of peeled sources, 
increasing the contribution from the residual signal compared with the thermal noise. 



4. Optimal information balance: data versus priors 

Figure ^a) demonstrates that the measurable sky position precision can exceed one arcminute 
for weak sources and low-sensitivity arrays. In this regime, the ionospheric refraction on short 
timescales (8-10 seconds) may be compar^able to, or smaller than, the precision available from 
the dataset alone. An optimal estimator would balance the information available from the current 
dataset and previous timesteps to obtain the most accurate and precise position estimate (accuracy 
refers to parameter bias, and one must be careful not to bias the estimate by improper use of prior 



information). Using the HCRLB formalism derived in Section |2.1.1| , we compute the precision 
available with an optimal estimator for two example characteristic refraction scales (60", 10"), 
and display the results in Figure ^(b, c). It is evident that for weak signals, the prior information 
dominates the Fisher matrix, while estimation for bright sources relies solely on the information 
available in the current dataset. These results suggest that in conditions of low ionospheric activity, 
there are substantial advantages available by considering a hybrid estimation scheme. 
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5. Summary 

Point source estimation on short timescales is a primary observing task for upcoming low- 
frequency instruments, both for fundamental calibration and removal of foregrounds for statistical 
EoR experiments. Precise and accurate estimation is therefore critical to the quality and fidelity of 
all downstream quantitative science results. In this work we considered the optimal estimation of 
bright point source sky positions, and studied the impact of limited dataset information on the noise 
level in visibilities, and on EoR angular power spectrum estimation. We found that the magnitude 
of the signal is expected to be below the thermal noise uncertainty, and therefore not a limiting 
factor for EoR estimation. We do find, however, that the signal is structured differently to the 
thermal signal (it is coloured), and should be considered in the data analysis. 

Incorporation of prior information about point source position can improve the precision with 
which positions can be estimated. For weak sources and compact, low-sensitivity arrays, the char- 
acteristic ionospheric refraction scale may be compai^able to the precision available from the data, 
and an optimal estimator would use a balance of prior and current information. 
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